3D Shape Induction from 2D Views of Multiple Objects

نویسندگان

  • Matheus Gadelha
  • Subhransu Maji
  • Rui Wang
چکیده

In this paper we investigate the problem of inducing a distribution over three-dimensional structures given twodimensional views of multiple objects taken from unknown viewpoints. Our approach called “projective generative adversarial networks” (PrGANs) trains a deep generative model of 3D shapes whose projections match the distributions of the input 2D views. The addition of a projection module allows us to infer the underlying 3D shape distribution without using any 3D, viewpoint information, or annotation during the learning phase. We show that our approach produces 3D shapes of comparable quality to GANs trained on 3D data for a number of shape categories including chairs, airplanes, and cars. Experiments also show that the disentangled representation of 2D shapes into geometry and viewpoint leads to a good generative model of 2D shapes. The key advantage is that our model allows us to predict 3D, viewpoint, and generate novel views from an input image in a completely unsupervised manner.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Approach for Quantitative Evaluation of Reconstruction Algorithms in SPECT

ABTRACT Background: In nuclear medicine, phantoms are mainly used to evaluate the overall performance of the imaging systems and practically there is no phantom exclusively designed for the evaluation of the software performance.  In this study the Hoffman brain phantom was used for quantitative evaluation of reconstruction techniques. The phantom is modified to acquire t...

متن کامل

Feature based 3D Object Recognition using Artificial Neural Networks

The recognition of objects is one of the main goals for computer vision research. This paper formulates and solves the problem of three-dimensional (3D) object recognition for Polyhedral objects. A multiple view of 2D intensity images are taken from multiple cameras and used to model the 3D objects. The proposed methodology is based on extracting set of features from the 2D images which include...

متن کامل

A relaxation algorithm for real-time multiple view 3D-tracking

In this paper we address the problem of reliable real-time 3D-tracking of multiple objects which are observed in multiple wide-baseline camera views. Establishing the spatio-temporal correspondence is a problem with combinatorial complexity in the number of objects and views. In addition vision based tracking suffers from the ambiguities introduced by occlusion, clutter and irregular 3D motion....

متن کامل

Seeing Glassware: from Edge Detection to Pose Estimation and Shape Recovery

Perception of transparent objects has been an open challenge in robotics despite advances in sensors and datadriven learning approaches. In this paper, we introduce a new approach that combines recent advances in learnt object detectors with perceptual grouping in 2D, and projective geometry of apparent contours in 3D. We train a state of the art structured edge detector on an annotated set of ...

متن کامل

Unsupervised learning through one-shot image-based shape reconstruction

Objects are three-dimensional entities, but visual observations are largely 2D. Inferring 3D properties from individual 2D views is thus a generically useful skill that is critical to object perception. We ask the question: can we learn useful image representations by explicitly training a system to infer 3D shape from 2D views? The few prior attempts at single view 3D reconstruction all target...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1612.05872  شماره 

صفحات  -

تاریخ انتشار 2016